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Abstract — Decoding of linear space-time block codes (STBCs) 
with spliere-decoding (SD) is well known. A fast-version of the 
SD known as fast sphere decoding (FSD) has been recently 
studied by Biglieri, Hong and Viterbo. Viewing a linear STBC 
as a vector space spanned by its defining weight matrices over 
the real number field, we define a quadratic form (QF), called 
the Hurwitz-Radon QF (HRQF), on this vector space and give 
a QF interpretation of the FSD complexity of a linear STBC. 
It is shown that the FSD complexity is only a function of the 
weight matrices defining the code and their ordering, and not 
of the channel realization (even though the equivalent channel 
when SD is used depends on the channel realization) or the 
number of receive antennas. It is also shown that the FSD 
complexity is completely captured into a single matrix obtained 
from the HRQF. Moreover, for a given set of weight matrices, 
an algorithm to obtain a best ordering of them leading to the 
least FSD complexity is presented. The well known classes of 
low FSD complexity codes (multi-group decodable codes, fast 
decodable codes and fast group decodable codes) are presented 
in the framework of HRQF. 

I. Introduction & Preliminaries 

Consider a minimal-delay space-time coded Rayleigh quasi- 
static flat fading MIMO channel with full channel state infor- 
mation at the receiver (CSIR). The input output relation for 
such a system is given by 



Y = HX + N, 
is the channel matrix and N G 



(1) 



where H G C""^""* is the channel matrix and JM G (U'"--'" is 
the additive noise. Both H and N have entries that are i.i.d. 
complex-Gaussian with zero mean and variance 1 and A^o 
respectively. The transmitted codeword is X G C"'^"* and 
Y G C"'^"* is the received matrix. The ML decoding metric 
to minimize over all possible values of the codeword X, is 



M(X) 



HX 



2 

Lf 



(2) 



Definition 1: A linear STBC [1]: A linear STBC C over a 
real (1 -dimensional) signal set S, is a finite set of rit x rit 
matrices, where any codeword matrix belonging to the code 
C is obtained from, 

K 

X{xi,X2, ■.■,xk) = ^XiAi, (3) 

i=l 

by letting the real variables xi,X2---,xk take values from a 
real signal set S, where A; are fixed rtf x rtf complex matrices 
defining the code, known as the weight matrices. The rate of 
this code is complex symbols per channel use. 



We are interested in linear STBCs, since they admit Sphere 
Decoding (SD) [2] which is a fast way of decoding for the 
variables. A further simplified version of the SD known as 
the fast sphere decoding (FSD) [3] (also known as conditional 
ML decoding) was studied by Biglieri, Hong and Viterbo. The 
quadratic form (QF) approach has been used in the context of 
STBCs in [4] to determine whether Quaternion algebras or 
Biquaternion algebras are division algebras, an aspect dealing 
with the full diversity of the codes. This approach has not been 
fully exploited to study the other characteristics of STBCs. 
In this paper, we use this approach to study the fast sphere 
decoding (FSD) complexity of STBCs (a formal definition of 
this complexity is given in Subsection III-BI ). 

Designing STBCs with low decoding complexity has been 
studied widely in the literature. Orthogonal designs with single 
symbol decodability were proposed in [5], [6], [7]. For STBCs 
with more than two transmit antennas, these came at a cost 
of reduced transmission rates. To increase the rate at the cost 
of higher decoding complexity, multi-group decodable STBCs 
were introduced in [8], [9], [10]. Fast decodable codes (codes 
that admit FSD) have reduced SD complexity owing to the fact 
that a few of the variables can be decoded as single symbols 
or in groups if we condition them with respect to the other 
variables. Fast decodable codes for asymmetric systems using 
division algebras have been recently reported [11]. Golden 
code and Silver code are also examples of fast decodable codes 
as shown in [12] and [13]. The properties of fast decodable 
codes and multi-group decodable codes were combined and 
a new class of codes called fast group decodable codes were 
studied in [14]. 

A. Hurwitz-Radon Quadratic Form 

In this subsection we define the Hurwitz Radon quadratic 
form (HRQF) on any STBC. We first recall some basics about 
quadratic forms. More details can be seen in [15]. 

Definition 2: Let be a field with characteristic not 2, and 
y be a finite dimensional F- vector space. A quadratic form 
on V is defined as a map Q : V — F such that it satisfies 
the following properties. 

. Q (av) = a^Q (v) for all v G F and all a G F. 

. The map B (v, w) = i [Q (v + w) - Q (v) - g (w)] for 
all V, w G is bilinear and symmetric. 

If we consider V as an n-dimensional vector space over F, 
then we can also consider the quadratic form as a homoge- 



neous polynomial of degree two, i.e., for I < i,j < n, we 
have scalars rriij such that 

n 

Qi'v) =Q{vi,V2,-.,Vn) = ^ iriijV.Vj (4) 

ij = l 

for all V = [I'l, Vn] € V. Hence, we can associate a matrix 
M = {niij) with the quadratic form such that Q (v) = vMv"^. 



Definition 3: The Hurwitz Radon quadratic form is a map 
from the STBC C = |x 
numbers M, i.e., Q : C — 



J2f=i^i^i^ to the field of real 



given by 



Q (X) = ^ XiXjdij, 

l<i<j<K 

where X is an element of the STBC and 



(5) 



A,A 



H 



-A, A 



H 



-^ij — II ^i-^j ' -^J-^i \\f ■ 

Theorem 1: The map defined by (|5]) is a quadratic form. 
Proof: The map Q needs to satisfy the conditions as 
defined in Definition |2] We have 



Q(aX) 



E 

^,3 



CLX^.CIX j .dij — CL ^ ^ Xi.Xj.dij — CI Q (^) 



and 



B (X, Y) - - [Q (X + Y) - (X) - Q (Y)] 



should be bilinear and symmetric where X = X^fLi XiAi and 



Y = X^iLi iUi-^i)- Substituting and simplifying, we get 

i? (X, Y) = i ^ [xiUidu + {xiUj + XjUi) dij] . 



It is clearly seen that this map is bilinear and symmetric. ■ 
We can associate a matrix with the HRQF. If we define the 
matrix M = {rriij) where i,j = 1,2,...,K such that rriij = 
dij, then we can write the HRQF as Q (x) ~ xMx^, where 
X = [xi X2 ■■■ xk]- Notice that M is a symmetric matrix and 



if and only if A^A 



H 



-A,Af =0. 



The following example shows that the PSD complexity 
depends on the ordering of the weight matrices or equivalently 
the ordering of the variables. 

Example 1: Let us consider the Silver code given by: 



X = Xa (Sl, S2) + TXf, (Zi, Z2) 



(6) 



where Xa and X;, take the Alamouti structure, and 



Xq (si,S2) 



51 -S2 

52 Si 



, Xb (2:1,22) 







and [zi, Z2] 
3sen to 
and is given by U = 



U[s3,S4] , where U is a 



1 

-1 

unitary matrix chosen to maximize the minimum determinant 

1+j -1 + 2.? ^ 

_ 1 + 2j 1-3 
Let all the variables take values from a signal set of 
cardinality AI. If we order the variables (and hence the weight 
matrices) as [su, sig, S2/, S2q, S3/, ssq, S4/, S4q], then the R 
matrix for SD has the following structure 



where t denotes non zero entries. We can clearly see that 
the Silver code admits fast decoding with this ordering with 
PSD complexity M^. However, if we change the ordering to 
[si/,siq,S4/,S2Q,S3/,S3Q,S2/,S4q], then the R matrix for 
SD has the following structure 



where t denotes non zero entries. With this ordering, the PSD 
complexity increases to Af^. 

The contributions of this paper are as follows: 

• We give a formal definition of the PSD complexity of a 
linear STBC (Subsection HLBI ) 

• With the help of HRQP, it is shown that the PSD com- 
plexity of the code depends only on the weight matrices 
of the code with their ordering, and not on the channel 
realization (even though the equivalent channel when SD 
is used depends on the channel realization) or the number 
of receive antennas. 

• A best ordering (not necessarily unique) of the weight 
matrices provides the least PSD complexity for the STBC. 
We provide an algorithm to be applied to the HRQP 
matrix which outputs a best ordering. 

The remaining of the paper is organized as follows: In 
Section |ll] the known classes of low ML decodable codes, 
the system model and the formal definition of the PSD 
complexity of a linear STBC are given. In Section |III1 we 
show that the PSD complexity depends completely on the 
HRQP and not on the channel realization or the number of 
receive antennas. In Section |IV] we present an algorithm to 
modify the HRQP matrix in order to obtain a best ordering 
of the weight matrices to obtain the least PSD complexity. 
Concluding remarks constitute Section [V] 

Notations: Throughout the paper, bold lower-case letters are 
used to denote vectors and bold upper-case letters to denote 
matrices. Por a complex variable x, xi and xq denote the real 
and imaginary part of x, respectively. The sets of all integers, 
all real and complex numbers are denoted by Z, M and C, 
respectively. The operation of stacking the columns of X one 
below the other is denoted by vec (X). The Kronecker product 
is denoted by (8), It and Ot denote the T xT identity matrix 
and the null matrix, respectively. Por a complex variable x, 
the (.) operator acting on x is defined as follows 



XI 



-XQ 
XI 



The (.) operator can similarly be applied to any ma- 
trix X e ([^nxm replacing each entry Xij by 
, i = 1,2, ■■■ ,n,j = 1,2, • • ■ , m, resulting in 
a matrix denoted by X G K2nx2m Qjven a complex 
vector X = [xi,X2,--' T^n}^ , x is defined as x = 

[XU,X1Q, ■ ■ ■ ,XnI,XnQ]^ ■ 



II. System Model and Dehnition of FSD 

COMPLEXITY 

For any Linear STBC with variables xi,X2---, xk given by 
(O, the generator matrix G [3] is defined by vec (X) = Gx, 
where x = [xi,X2.-.,xk\'" ■ In terms of the weight matrices, 
the generator matrix can be written as 



G= wee (Ai) wee (A2) ■•• wee(A/^) 
Hence, for any STBC, ([B can be written as 
wee (Y) — HeqX + vec (N), 



where Heq S 



2n,-ntXK gjygjj jj^^ ^ (-j^^ ®H)G, 

and X = [xi, 2:2..., xjf] , with each x.; drawn from a 1- 
dimensional (PAM) constellation. Using the above equivalent 
system model, the ML decoding metric (|2]i can be written as 



M(x) =11 wee(Y) 



HggX \\p 



Using QR decomposition of He^, we get Hgg = QR where 
Q g ]g2n,,,itxK orthonormal matrix and R S M^^-^^ 

is an upper triangular matrix. Using this, the ML decoding 
metric now changes to 



M (x) =11 Q^wee (Y) - Ri \\l=\\ y - Ri I|| 



(7) 



If we have He^ = [hih2..., h/^] , where hi,i G \,2,...,K are 
column vectors, then the Q and R matrices have the following 
form obtained by the Gram-Schmidt orthogonalization: 



Q = [Qi q2 - Qx] ) 

where q^, i G 1, 2, A' are column vectors, and 



(8) 



R 



ri II (qi,h2) (qi,h3) 
I|r2|| (q2,h3) 
II rg II 



(qi,hK) 
(q2.hif) 
(qs'hi^) 



(9) 



where ri = hi, ({^ — lyp^iy and for i = 2, .../C, 



Ti =h, -^(q^-,hi)q^-, q,, 



A. Multi-group decodability, fast decodability and fast group 
decodability 

In case of a multi-group decodable STBC, the variables can 
be partitioned into groups such that the ML decoding metric 
is decoupled into submetrics such that only the members of 
the same group need to be decoded jointly. It can be formally 
defined as [9], [16], [17]: 

Definition 4: An STBC is said to be g-group decodable 
if there exists a partition of {1,2,...,!^} into g non-empty 
subsets ri,r2,...,rg such that the following condition is 
satisfied: 



A,A 



H 



H 



0, 



whenever I G Vi and m G Vj and i ^ j. 
If we group all the variables of the same group together in (|7]), 
then the R matrix for the SD [2], [18] in case of multi-group 
decodable codes will be of the following form: 



R 



Ai 
A2 










(10) 



where Aj, i = 1, 2, g is a square upper triangular matrix. 

Now, consider the standard SD of an STBC. Suppose the R 
matrix as defined in ^ turns out to be such that when we fix 
values for a set of symbols, the rest of the symbols become 
group decodable, then the code is said to be fast decodable. 
Formally, it is defined as follows: 

Definition 5: An STBC is said to be fast SD if there exists 
a partition of {1,2,...,L} where L < K into g non-empty 
subsets ri,r2,...,rg such that the following condition is 
satisfied 

(q„h,)=0(^<J), (11) 

whenever i G Fp and j G Tg and p ^ q where q^ and are 
obtained from the QR decomposition of the equivalent channel 
matrix He, = [hih2..., Ha'] = QR with hi,i G 1,2,..., K as 
column vectors and Q = [q^ qj ... q^:] with q^, i G 1,2, K 
as column vectors as defined in (H)). 

Hence, by conditioning K — L variables, the code becomes 
g-group decodable. As a special case, when no conditioning 
is needed, i.e., L = K, then the code is g-group decodable. 
The R matrix for fast decodable codes will have the following 
form: 

' A Bi 
B2 



R 



(12) 



where A is an L x L block diagonal, upper triangular matrix, 
B2 is a square upper triangular matrix and Bi is a rectangular 
matrix. 

Fast group decodable codes were introduced in [14]. These 
codes combine the properties of multi-group decodable codes 
and the fast decodable codes. These codes allow each of the 
groups in the multi-group decodable codes to be fast decoded. 
The R matrix for a fast group decodable code will have the 



following form: 



R 



Ri 
R2 










R„ 



(13) 



where each Ri 



1,2,... 



R, 



g will have the following form: 
■ A,: B,:, 



B, 



(14) 



where is an L; x Li block diagonal, upper triangular 
matrix, B^^ is a square upper triangular matrix and B^^ is 
a rectangular matrix. The structure of the R matrix for each 
of the codes defined above depends upon the ordering of the 
weight matrices. If we change the ordering of the weight 
matrices, the R matrix may lose its structure and no longer 
exhibit the desirable decoding properties. The Silver code of 
Example [T] illustrates this aspect. In the following subsection, 
we give a formal definition of the FSD complexity. 

B. FSD Complexity of an STBC 

In this section we define the FSD complexity of an STBC. 
First we consider a single group decodable case and define 
the FSD complexity for a particular ordering. We then extend 
the definition to a multi-group decodable STBC. Let C be 
an STBC with the weight matrices Ai, Ai^, where all the 
variables take values from a signal set of cardinality M . Let 
R be the matrix obtained by the QR decomposition used for 
SD. Let the ordering of the weight matrices used to obtain the 
R matrix be Ai, A^j- . Denote by li the number of variables 
that need to be conditioned, when we use FSD on the R matrix. 
After conditioning li variables, let the rest of the variables 
be pi -group decodable. In case the code is not fast sphere 
decodable, then we set li to the total number of variables 
in the matrix. Denote by R^^ the submatrix containing the 
variables of the ii-th group after conditioning li variables in 
the R matrix and by rii^ the number of variables in the ii- 
th group, where 1 < ii < pi- We can use FSD on each of 
the Rij matrices now. Let us denote by the number of 
variables that need to be conditioned, when we use FSD on 
the Rij matrix. After conditioning ^i ^^ variables in R^^, let 
the rest of the variables be pi.^^ -group decodable. Let p2 = 
maxjj {pi.il)- Denote by R^^^^a submatrix containing the 
variables of the i2-th group after conditioning li,i^ variables in 
the Rij matrix and by Ui^.i^ the number of variables in the 12- 
th group, where 1 < 12 < P2- It may so happen that pi < p2 
for some ii. In such cases we set rii^.i^ = for 12 > Pi.ii- We 
continue this process till we cannot apply FSD any more. This 
process terminates since there are a finite number of variables 
and the number of variables are decreasing with each iteration 
of the FSD. Let the process stop after A such iterations. In the 
(A — 1) -th iteration, we have the following: Let us denote 

by ^i,ii,i2 ix-i number of variables that need to be 

conditioned, when we use FSD on the Rij^ij i^_i matrix. 

After conditioning li.ii.i2,...,ix-i variables in R^^^^a ja-i' 

the rest of the variables be pi^i-^.i^ i;^_j-group decodable. Let 



px = maxii_i2^...^j^_j {pi.n,i2,...,ix-i)- Denote by Rii,^^,...,^^ 
the submatrix containing the variables of the i\-th group 

after conditioning /i.ii.ia ix-i variables in the Ri^.i^ ix-i 

matrix and by rii-^^i^ i^ the number of variables in the ix- 

th group where I < ix < P\- On the last iteration, we have 

h,n,i2,...,ix = nii,i2....4x for 1 < *A < Pa- 

Let ~ maxi^. (^Ii.ij^.i2,....ij + fci.ii,i2,...,i3) for 

2 < j < A, 1 < ij < pj and ki^i-^^ir^^,,,^i^ ~ and fci = 

Definition 6: We define the FSD complexity of a single 
group decodable STBC for the given ordering to be 

In case of a multi-group decodable code with g groups, the 
R matrix will be a block diagonal matrix. We then calculate 
the FSD complexity of each group independently as described 
above and choose the maximum among them as the FSD 
complexity of the STBC. 

Definition 7: We define the FSD complexity of a multi- 
group decodable STBC with g groups to be max; (^A/'*"*''^'*) , 
where 1 < i < g. 

We present a few examples to get a better understanding of 

FSD complexity. 

Example 2: Let the R matrix be of the form: 



ail 


ai2 








ai5 


ai6 


an 


CllS 





a.22 








025 


028 


027 


£128 








a.33 


a.3i 


035 


036 


037 


asa 











044 


045 


046 


047 


0.48 














055 


























166 


























077 


























ass 



Now if we use FSD on this matrix, we will condition h = 4 
variables and obtain a 2-group decodable code. We have 



Ri 



an 




ai2 
022 



R 



2 — 



033 




034 
044 



Since the number of variables in each of the above matrices 
are 2, we have ni = n2 = 2. We cannot condition any more 
variables in either Ri or R2. So the process stops here and 
we set li^i = 2 and Zi,2 = 2. We have 

h.i = ki,2 = 0, 
ki = max (^14 + /i,2 + ^1,2) = 2. 
The FSD complexity of this STBC for the given ordering is 

Example 3: Let the R matrix be of the form: 





" ai,l 





ai,3 


ai,4 














t 


t - 







02,2 


12,3 


0-2,4 
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t 










03,3 


03.4 














t 


t 













04,4 














t 


t 


R = 


















05.5 





16,6 






as.s 

£16, s 


t 
t 


t 
t 






















£17.7 


£17,8 


t 


t 

























£18,8 


t 


t 




























t 


t 




. 


























t . 



Now if we use FSD on this matrix, we will condition li 
variables and obtain a 2-group decodable code. We have 

ai,i ai^3 aiA 

02,2 02,3 a2A 

03,3 03,4 

044 



Ri 



05,5 05,8 

D _ a6,6 a6,8 
U U 07^7 07^8 
a8'8 

Since the number of variables in each of the above matrices are 
4, we have ?ii = 71,2 = 4. Now if we use FSD on Ri, we can 
condition lij = 2 variables and obtain a 2-group decodable 
code. If we use FSD on R2, we can condition 2 ~ 1 variable 
and obtain a 3-group decodable code. We now have 

Ri,i = [ ] , Ri,2 = [ 02.2 ] , 

R2,l [ 05,5 ] , R2,2 [ 06,6 ] , R2,3 = [ O7J ] , 

"1,1 = "-1,2 = "2,1 = "2,2 = "2,3 = 1 and ni^3 = 0. 

We cannot condition any more variables in any of these 
matrices. So the process stops here and we set 

^1,1,1 — h,i,2 — '1,2,1 ~ '1,2,2 — '1,2,3 — 1- 

We have 

fcl.1,1 = fcl,l,2 = fcl,2,l = fcl,2,2 = fcl,2.3 = 0, 

ki^i = max (^1,1,1 + fci, 1,1, '1,1,2 + '^1,1,2) = 1, 
fci,2 = max (/i_2,i + fci, 2,1, '1,2.2 + fci,2,2, '1,2.3 + ^1,2,3) = 1, 
ki = max (^1.1 + kis,li,2 + ^1,2) = 3. 

The FSD complexity of this STBC for the given ordering 

is A'f'i+'^i = A/5. 

III. HRQF AND FSD COMPLEXITY 

In this section we show that the HRQF matrix is enough to 
determine the FSD complexity of an STBC and hence the FSD 
complexity is independent of the channel matrix realization 
or the number of receive antennas. Towards this end, we 
prove that the zeros in the R matrix which determine the FSD 
complexity are also zeros in the HRQF matrix. First we define 
an ordered partition of a set. 

Definitions.- We call a partition of {ai, 02, Oif } 
into g non-empty subsets ri,r2,...,rg with cardinalities 
Ki,K2,-.-,Kg an ordered partition if {ai,...,aKi} G 
Fi, {fl/fj+i, a^^+Ka} e r2 and so on, till 

Now we address the class of multi-group decodable codes. 

Lemma 1: Consider an STBC C ~ SiLi ^i^i- Let M 
denote the HRQF matrix of this STBC. If there exists an 
ordered partition of {1, 2, A'} into g non-empty subsets 
Fi, r2, Fg such that = whenever i G Fp and j S Fg 
and p q, then the code is g-group sphere decodable. In other 
words, the FSD complexity of the STBC is determined by the 
HRQF matrix. 

Proof: Let R be the matrix obtained from the QR 
decomposition of Hgg. For the code to be gi-group sphere 
decodable, we need to prove that = 0, whenever i G Fp and 
j e F, and py^q.We know from [12] that if A.,Af +AjAf = 
is satisfied for some i, j then the corresponding columns in 
the Heg matrix are orthogonal, i.e., {hi,hj) = 0. We also 



H 



AjAf = 0. Let 



know that = if and only if A^A 
Lp = Yfq=i iTql where 1,2,..., g and Lq = 0. 

For any group Fp, we need to prove that rij = for Lp-i + 
1 < i < Lp and Lp + 1 < j < K. Consider the first group Fi. 
We have m,j = for 1 < i < ii and ii + 1 < j < i^T. We 
need to prove that the R matrix has zero entries at the same 
locations. The proof for this is by induction. 
For i ~ 1 and for any j > Li + 1, 

since = jjijhi. Now, let (Q;, h^) = for alH < i for any 
i such that 1 < i < ii. We have. 



1 



E 

1=1 



{qi,h,)qi,hj) 



(h„hj) -^(q,,h,)(q,,hj) 



1=1 



= 0, 



since {hi,hj) = as niij ~ and (q;,hj) = for I < i, hy 
induction hypothesis. 

Now consider the p-th group Fp. Let the induction hypoth- 
esis be true for all groups l,2,...p — 1. Consider where 

ip_i + I < i < Lp and Lp + 1 < j < K. We have. 



1 



1 



0, 



since (hi,hj) = as rriij = and (q;,hj) = for / < i by 
the induction hypothesis. ■ 

We now consider an example to illustrate the above lemma. 

Example 4: Consider the 2 x 2 ABBA code given by [19]: 



-X2+ jX3 



-X2 +JX3 
Xl + jX4 



where xi G E for i = 1, 2, 3, 4. This is a two group decodable 
code with {xi,X2} belonging to one group and {x3,X4} 
belonging to the other. The structure of the HRQF matrix M 
and the R matrix are given below with [xi, X2,X3, X4] as the 
ordering of the variables and the weight matrices. 



M 



t t 

t t 

t t 

t t 



R 



t t 

t 

t t 

i 



where t denotes the non-zero entries. As it can be seen, the 
upper triangular portion of M matrix and the R matrix have 
the same structure. 

Now we move on the class of fast decodable codes. 

Lemma 2: Consider an STBC C ~ 'l2iLi^i-^i- Let M 
denote the HRQF matrix of this STBC. If there exists an 
ordered partition of {l,2,...,i} where L < K into g non- 
empty subsets ri,r2,...,rg such that = whenever 



i £Tp and j G Tq and p ^ q, then the code is fast decodable 
or conditionally g-group decodable. 

Proof: The proof follows from the proof of Lemma [T] by 
replacing K with L. ■ 
We now consider an example to illustrate the above lemma. 

Example 5: Consider the Silver code as mentioned 
in Example [U If we order the variables (and 
hence the weight matrices) in the following fashion 
[si/, siQ, S2/, S2Q, S31, S3Q, S47, S4q], then the HRQF matrix 
M and the R matrix will have the following structure: 

'toootttt' 
otootttt 
oototttt 
_ ooottttt 

t t t t t ' 

ttttotoo 
ttttooto 
ttttooot 

'toootttt' 
otootttt 
oototttt 
ooottttt 

OOOOtOOO ' 
t 
i 
t 

where t denotes the non-zero entries. As it can be seen, the 
upper triangular portion of the matrix M, has a structure 
that admits fast decodability which is conditionally 4-group 
decodable if considered as the R matrix. 

We now turn to the class of fast group decodable codes. 

Lemma 3: Consider an STBC C = X^ili^j^j- Let M 
denote the HRQF matrix of this STBC. If there exists an 
ordered partition of {1,2,..., A'} into g non-empty sub- 
sets Fi, Fg with cardinalities Ki, K2, Kg such that 
rriij = whenever i E Tp and j G Tq and p ^ q, and if any 
group Ti admits fast decodability, i.e., there exists an ordered 

partition of {^/Ii Ki + 1, Y.lZl Ki + 2, E/=i Ki + L^] 
where L; < Ki, into non-empty subsets , T^j, 
such that rrirs = whenever r G and s G Tj and p ^ q, 
i = 1,2, g, then the code is fast group decodable. 

Proof: The proof follows from the proofs of lemmas [T] 
andlH ■ 
We now consider an example to illustrate the above lemma. 

Example 6: Consider the fast group decodable STBC [14] 
given in ( fTST i . 

Let the ordering of the variables (and hence the weight 
matrices) be [si,S2, sij]. This STBC is two group decod- 
able with si in one group and {s2, s^, sn} in the other 
The second group is conditionally five group decodable. The 
HRQF matrix M and the R matrix are given in ( fTST i and (fTTI i 
respectively, where t denotes the non zero entries. 
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complexity of the STBC depends only upon the HRQF matrix 
M and not on the Heg matrix, i.e., the FSD complexity is 
independent of the channel matrix and the number of receive 
antennas. It can be completely captured into a single matrix 
obtained from the set of weight matrices and their ordering. 

IV. Algorithm for a Best Ordering of the Weight 
Matrices 

As seen in Example [T] the ordering of weight matrices 
determines the FSD complexity of an STBC. We have also 
seen that the HRQF matrix completely determines the FSD 
complexity of an STBC. In this section we present an algo- 
rithm that uses the HRQF matrix as an input and manipulates it 
in order to obtain a best possible ordering of weight matrices. 
We do so by using row and column permutations of the 
HRQF matrix. The rows and columns of the HRQF matrix 
are in one to one correspondence with the ordering of the 
weight matrices. Hence, if we change the ordering of the 
weight matrices, the HRQF matrix changes accordingly and 
vice verse. For example, any transposition in the ordering of 
the weight matrices will result in swapping the corresponding 
rows and columns (since HRQF matrix is symmetric) of the 
HRQF matrix. 

Remark 1: Note that we cannot perform such a manipula- 
tion on the R matrix since it depends not only on the order of 
weight matrices but on the channel matrix as well. Also, all the 
entries of the R matrix do not depict the HR orthogonality of 
the weight matrices, i.e., the (i,j)-th entry of the matrix may 
not be zero even if the i-th and j-th weight matrices are HR 
orthogonal. Hence, the R matrix needs to be calculated each 



+ jS2 + jsi5 + jsie + jsi7 
~S7 + jss - Si3 + JS14 
-S3 + jS4 - Sii + JS12 
S5 - jS6 - Sg + jsio 



S7 + jS8 + Si3 + jsi4 
Si + jS2 + JSis - jsie - jsi7 
— Ss — jse — Sg — jsio 
-S3 - jS4 + Sii + jsi2 



S3 + iS4 + Sii + jsi2 
S5 - jsa + Sg - jsio 

- is2 + isi5 - jsie + jsn 

-S7 - jss + Si3 + JS14 



— ss — jse + Sg + jsio 

S3 — iS4 — Sii + jsi2 
S7 — jS8 — Sl3 + jsi4 
- jS2 + JSis + JS16 - jsi7 



(15) 



time the ordering of the weight matrices is changed which is 
not so in the case of the HRQF matrix. 

The algorithm to get a best possible ordering is given in 
Algorithm [T] in the next page. 

An important structural property of the HRQF matrix: 
Before we delve into the proof of correctness of the algorithm, 
let us make a few observations regarding the structure of the 
HRQF matrix for various scenarios of decoding. Let C be 
an STBC with weight matrices Ai, Ax and corresponding 
variables xi, a;x. The code C can be multi-group decod- 
able, fast-decodable, fast group decodable or none of these. 
The structure of the upper triangular portion of the HRQF 
matrix for the first three of these are shown in the equations 
([Tol l. (fT2] i and ( fT3] l respectively. In case the code does not allow 
any of these forms of decoding, the ordering of the variables is 
immaterial. Let A be a best ordering of the variables. Without 
loss of generality, let the first variable in this ordering be 
labelled xi. Irrespective of the type of decoding provided by 
the code, we notice that all the variables that need to be jointly 
decoded with xi are adjacent to it in the HRQF matrix. These 
are followed by the set of zeros which indicate all the variables 
which are Hurwitz-Radon orthogonal with the previous set. 
These are further followed by the variables that are to be 
conditioned in order to obtain this group-decodable structure. 
In case of multi-group decodable codes, this is a null set. 

Proof of Correctness: We need to show that the algorithm 
produces a best possible ordering for FSD complexity. The 
algorithm uses the structural property of the HRQF matrix 
for obtaining a best ordering. Given any ordering A, with the 
first variable as xi, the algorithm partitions the variables into 
three sets - Ai,A2 and A3. In case of multi-group decoding 
and fast group decoding, Ai contains all the variables that 
belong to the same group as xi. In case of fast decoding, it 
represents the set of variables that need to be jointly decoded 
with xi after conditioning. A2 contains all the variables HR 
orthogonal with Ai. A3 contains the variables that need to be 
conditioned. This will be an empty set in case of multi-group 
decoding and hence multi-group decoding can be considered 
as fast decoding with no conditioned variables. The algorithm 
is recursively run on the sets Ai and A2 to further order the 
variables (as in case of fast group decoding). 

First, we fix the first variable in the given ordering. Let it 
be Xi for some 1 < « < K- The algorithm starts with only 
Xi in Ai, all variables which are HR orthogonal with Xi in 
a temporary set Af and the rest of the variables in A3. The 
variables HR orthogonal with Xi can be easily identified as 
they correspond to the zero entries in the first row. For ease of 
manipulation, we move all the variables in the set At adjacent 
Xi in the ordering. This is equivalent to grouping all the zeros 
in the first row and placing them adjacent to the (1,1) entry. 



as these denote all the variables HR orthogonal with Xi. The 
current ordering of variables is - {2;^, Ai, A3}. For the working 
of the algorithm, the following set operations are synonymous 
with the following matrix operations on the HRQF matrix: 

• Moving a variable Xj into Ai - Suppose the variable Xj 
is the p-th element in the current ordering, we move the 
p-th column to the (|Ai| + l)-th column shifting the rest 
of the columns to the right and then we move the p-th 
row to the (|Ai| + l)-th row shifting the rest of the rows 
downwards. Update the ordering accordingly. 

• Moving a variable Xj into A3 - Suppose the variable Xj 
is the p-th element in the current ordering, we move the 
p-th column to the last column shifting the rest of the 
columns to the left and then we move the p-th row to the 
last row shifting the rest of the rows upwards. Update the 
ordering accordingly. 

• Moving a variable Xj into A2 - No change. Only the 
cardinalities of A2 and At will change accordingly. 

The algorithm works in two stages. 

. First, we find the largest L < K such that |Ai| + IA2I = 
L. 

• In the second stage, since the variables in Ai and A2 
will be decoded separately, we consider the submatrices 
representing them as HRQF matrices of some STBC and 
run the first step of the algorithm on them recursively. 

Let Xk be the first variable in At. It is currently the second 
variable in the overall ordering. We now proceed to find all 
the variables that are HR orthogonal with Xk and not HR 
orthogonal with Ai, since these will need to be jointly decoded 
with the variables in Ai. This can be found as follows. If 
any variable is HR orthogonal with Xk, it will have a zero 
entry in the column corresponding to x^. Hence we traverse 
down the column represented by Xk to find the next zero 
entry. Let it be found in the p-th row corresponding to the 
variable xi. We also need to ensure that this variable is not 
HR orthogonal with Ai. So, we traverse the p-th row from 
column 1 to column |Ai| and check for any non-zero entries. 
In case any of them are found, it means that this variable 
needs to be jointly decoded with Ai. We add xi to Ai. We 
now repeat the procedure on the variable Xk again until all the 
variables have been exhausted. Since there are a finite number 
of variables, this process terminates. Note that it is possible 
that a few members from At may move into Ai during this 
process. These variables will be accounted for later. We now 
add Xk to A2. We have now managed to create two sets which 
are HR orthogonal and the current ordering of the variables is 
{Ai,A2, At, A3}. 

We now proceed to the next variable in At. Let this be Xj. 
We need to ensure that Xj is HR orthogonal with Ai. This 
can be easily checked by counting the consecutive number 



Input: The HRQF matrix - M = (mij), the size of the HRQF matiix - 

K X K, the input ordering - input jDrdering 
Output: The best possible FSD complexity - best_dec_cmplxty and 
its corresponding ordering - best jar dering 

- current_ordering = inputjyrdering 

- best.dec_cmplxty = K; i = I 
repeat 

- Shift all the zero entries in the first row next to the first element 

- Let the number of zeros in the first row be num^erojo.ols 

- grp_size = 1; cur_zero_col = 2. (Vai' under consideration) 
if num.zero.cols = then 

I - dec.cmplxty = K 



end 
else 



repeat 

- flag = 

- Let no. of consecutive zeros in cur^zerajzol be n. 
if n < grp.size then 

- Move the cur_zero_col-ih variable to the end 
(Move the corresponding row and column to the end 
and moving the rest of the rows and columns 
upwards). 

- Update currentjordering. 

- nuni^zerajzols = num^erojcols — 1 

end 

if n > cur^eTo.col — 1 then 

- grp_size = cur_zero_col — 1 
Marker: 

- Find the next zero along the cur_zero_col column 
from the cur^ero.col row. 

- Let the next zero be found in the j-th row. 
for t = 1 to grp^size do 

if m,jf 7^ then 
I - flag = 1 
end 

end 

if flag = 1 then 

- Move up the j-th variable to the 
grp^size + 1-th position. - Update 
current jyrdering. 

- grp^size = grp^size + 1 

- cur_zero_col = cur_zero_col + 1 

- jump back to Marker 

end 

if flag = then 
I - cur.zero.col = cur.zero.col + 1 (No more 
I zeros left in this column to check) 
end 

end 

until cur.zero.col < grp.size + num.zero.cols ; 

- topjirgf. matrix = upper left grp.size X grp.size 
matrix. (With ordering top Jirgf .ordering). 

- bot.hrqf .matrix = squai'e matrix from row, column = 
grp.size + 1 to row, column = grp.size + num.zero.cols. 
(With ordering bot.hrqf .ordering). 
Run the current algorithm on the top and bottom matrices 

- [top.dec.cmplxity, best.top.ordering] = 
order Jirqf {top Jirqf .matrix , top.hrqf.size, top.ordering) 

- [bot.dec.cmplxity , best.bot.ordering] = 
order Jirqf {bot.hr qf .matrix, bot Jirqf .size, bot.ordering) 

- Update current.ordering with best.top.ordering and 
best.bot.ordering . 

- Number of variables conditioned: 
cond.vars = K — grp.size — num.zero.cols. 

- dec.cmplxty = 

cond.vars + max {top.dec.cmplxity , bot.dec.cmplxity} 

end 

if best.deccmplxty > dec.cmplxty then 
I - best. dec.cmplxty = dec.cmplxty 
I - best.ordering = current.ordering 
end 

- Circularly shift the variables (rows and columns) 

- Update current.ordering 

- i = i + l 
until i = K ; 

Algorithm 1: The algorithm to obtain a best ordering of 
weight matrices - order. hrqf 



of zeros in its column from the top of the column. If it is 
equal to |Ai|, then Xj is HR orthogonal with Ai. We add Xj 
to A2 in this case. Otherwise, we move Xj to A3. It may so 
happen that Xj may be HR orthogonal with Ai and the already 
existing members of A2. This can be checked by counting the 
consecutive numbers of zeros in its column from the top of the 
column and matching cardinalities. In such a case, we repeat 
the procedure of finding any variable that needs to be jointly 
decoded with Ai on this variable as well. If any such variable 
exists, we move that variable and the pre-existing variables of 
A2 into Ai. This is needed for fast group decoding scenarios. 
We continue this procedure till we have exhausted all the 
variables of At. We have now produced three sets Ai, A2 
and A3. Ai and A2 are HR orthogonal w.r.t. one another if we 
condition them on the variables of A3. 

We still need to process Ai and A2 further We repeat 
the same procedure on these variables considering them as 
independent HRQF matrices. If the cardinality of Ai is the 
same as that of the HRQF matrix row/column size, then we 
stop this procedure as it cannot be ordered any further and 
set the FSD complexity to |Ai|. This process terminates as 
the sizes of Ai and A2 keep reducing with each iteration and 
there are a finite number of variables. We calculate the FSD 
complexity of this ordering as - IA3I +max (rfi, ^2) where di 
and d2 are the FSD complexities of Ai and A2 respectively. 

Since we are dealing with fast sphere decoding and grouping 
the variables as sets Ai, A2 and A3 are the only possible ways 
to obtain a structure that will allow FS decoding, these are all 
the orderings we need to consider if we take the first 

variable. Hence, we have exhausted all the possible orderings 
which can provide FS decoding with the first variable, 

and a best possible FSD complexity for this case has been 
found by the above steps. Now, if we run through all the 
variables of the STBC making each one of them the first 
variable in turn and repeating the above procedure, we can find 
out a best possible FSD complexity for all possible orderings. 
Since all the variables have been given a chance to be the 
first variable, we have exhausted all possible orderings that 
offer FS decoding. And hence, the ordering provided by the 
algorithm is a best possible ordering for FSD complexity. ■ 

Remark 2: Since the algorithm recursively orders each set, 
it is capable of even ordering variables in scenarios where 
any group obtained from conditioning of variables admits 
conditional decoding. 

We now illustrate the working of the algorithm with an 
example. 

Example 7: Consider the Silver code presented 
in Example [T] If we order the variables as 
[su,Sii,S4g,S2Q,S3Q,S3i,S2i,siQ], wc get the following 
HRQF matrix and the R matrix for this ordering: 
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The FSD complexity for this ordering is M^. And this 
ordering does not admit fast decoding as well. 

When we run the algorithm on the given HRQF matrix, the 
two sets Ai and A2 are formed, which are HR orthogonal 
with each other In this case, A2 = {s2Q, S2/, sig} and 
Ai = {sii}. The conditioned variables will be present in 
the set A3 = {54/, S4Q, S3Q, S37}. The HRQF matrix at this 
stage is as given by ( fTSl l. The ordering of the variables at 
the end of this stage is [si/, S2q, S27, sig, S4/, S4Q, sag, S37]. 
Now, the variables from both sets Ai and A2 are run through 
the algorithm again. So, the top left 1x1 matrix and the next 
block diagonal 3x3 matrix are both fed to the algorithm. 
Since this is already the best possible ordering of these sets, 
the matrix M remains the same after this stage. And the fi- 
nal ordering obtained is [si/, S2q, S27, sig, S4/, S4Q, ssq, S3/]- 
The R matrix for this ordering is given by ( fT9] l 
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The FSD complexity for this R matrix is which is the 
best possible complexity for the Silver code. 

Example 8: Consider the fast group decodable code pre- 
sented in Example |6] If we order the variables as 
[s2,S3,---,sio,si,sii,si2,...,si7], we get the following 
HRQF matrix and the R matrix: 



M 



The FSD complexity for this R matrix is M^'^ but the best 
possible FSD complexity is Af When we run the algorithm 
on the given HRQF matrix, the two sets Ai and A2 are formed, 
which are HR orthogonal with each other. In this case, Ai = 

{S2, S3, S4, S5, S6, Sg, Sio, Sll, S12, S7, Ss, Si3, S14, S15, Siq} 

and A2 = {si}. The set A3 is empty as this provides a group 
decoding scenario. Now, the variables from both sets are 
run through the algorithm again. So, the top left 16 x 16 
matrix and the bottom right 1x1 matrix are both fed to the 
algorithm. The top left matrix is ordered according to the 
fast decoding algorithm as presented in the previous example. 
Since the bottom right matrix is a 1 x 1 matrix, it is returned 
without change. The final ordering of variables obtained is 

[S2, S3, S4, S5, Sq, S7, Ss, Si3, Si4, Si5, Si6, Sg, Sio, Sn, S12, Si]. 

The M and the R matrix for this ordering is as shown below. 
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This ordering gives us the FSD complexity of M 

V. Conclusion 

In this paper we have analysed the FSD complexity of an 
STBC using quadratic forms. We have shown that the HRQF 
completely categorizes the FSD complexity of an STBC and 
hence it is independent of the channel and the number of 
receive antennas. We have provided an algorithm to obtain 
a best ordering of weight matrices to get the best decoding 
performance from the code. 
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